|
|
Accession Number |
TCMCG041C23581 |
gbkey |
CDS |
Protein Id |
XP_019054912.1 |
Location |
complement(join(77980..78162,78514..78599,78716..78788,78885..78944,79226..79274,79896..79996,80130..80183,80286..80330,82469..82666,82761..82834,99065..99177,99262..99355,99439..100965,101371..101440,102912..103132,103233..103248,103325..103490,103571..103663,103806..103863,105112..105169,105301..105455,119029..119076,119204..119255,142714..142839)) |
Gene |
LOC104606810 |
GeneID |
104606810 |
Organism |
Nelumbo nucifera |
|
|
Length |
1239aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA264089 |
db_source |
XM_019199367.1
|
Definition |
PREDICTED: DNA mismatch repair protein MLH3 isoform X2 [Nelumbo nucifera] |
CDS: ATGAAGAGCATTAAGCACTTGCCAAAGGGTGTCCATAGTTCACTGCGCTCAAGTGTCATTCTTTTTGACTTGACAAGGGTTGTGGAAGAATTGATATTTAATAGTTTGGACGCTGGTGCAACAAAGATAACTGTATCTATAGGTGTTGGGACAAGCTATGTAAAAGTAGAGGATGATGGATCTGGAATTACTCGTGATGGATTGGTGCTGTTGGGAGAAAGAAATGCGACATCAAAGCTTCCCAGTTTGGCAGAAATTGATGTTTCTATGGGAAGTTATGGTTTTCAAGGAGAAGCATTGGGATCCCTATCTGATATCTCCTTGTTAGAAATCATTACAAAGGCTCGTGGCAGGCCAAGTGGATATCGCAAGGTTATAAAGGGATGCAAGTGTTTGTATCTTGGACTTGATGAGAGCAGACAAGATGTAGGTACAACAGTGATTGTTCGTGATTTATTTTATAACCAACCTGTTCGGAGAAAGTATATGCATTCCAGCCCTAAGAAGGTCTTGCACTCAGTCAAAAAATGTGTTTTGCGAATTGCACTTGTGCACCCACAGGTTTTCTTCAAAGTCATTGATATTGAGAGTGAGGATGAGTTGCTCTGCACACATTCTTCCTTGTCGCCATTGTCATTACTGTTGAACAGTTTTGGCTCTGAAATTTCTAGCTGTCTACACAAACTGAATTTTTCTCAGGGTGTATTGAAGCTTTCTGGATATTTATCCGGTCTTGGTGAGATTTGCTCAACAAAGGCATACCAGTATGTCTACATCAACTCACGGTTTATTTGCAAGGGTCCAATTCATAAACTGCTTAAAGATGTGGCAGATTCTTATATGTGTCTGGATCTGTGGAAAGGTAGTTCTGGGTCCCAAAATGGGAAAAGAAATAGGCCACAGACATATCCAACCTATATCCTAAATTTTTGTTGTCCACGCTCCAGCTATGACTTGACCTTTGAACCATCAAAAACATTTGTTGAATTCAAGGACTGGGCTCCTATACTTTCCTTTATTGAACAAGCTGTCAGACATTTTTGGAGTCAAATCTCAGTACAAGATATATCAGCATGCTCTGAAATTGTGGAAAAGAAGTGCAAGGTCAAATATTATCATAAATCTTTGAACCACAATTGTTCCTCTATGGAATTTGCATCTGAAGAGGCCAACTGTTATGAACAGAAAAAACATAAAATGGCCTCCAAAAAATTACAAAGAAATACTGCAGAAGTCAAAGGACAAAATGTAAAGGCAGAATATGTACCCTCAAGTTATCATTCTTTGCAGGACATGGTTTCCAATTCATGTGATCCTTCCATAACCAAATCTATACCAGTAGTTAATCAAGAGAATGGTGATGATCTTTTATGTGTAAAATGTAATGCTTTGACAGAAAGATTGGCTGCTTCACAGACTGCCACAGATGATGTAAAAAAATTTATTTTAGGACATAAGCAGGGAAATGAATCTCTCAAAGTTGATGTCATGGGTGAAGAATCAACAAGAACCCTGTTAACATGCAGTGATTTTGAATACAGAAATGATGTAGAAAGGGTTTCATTTCCTTCAGGATGCACCAGAGATTTTGAGAAGTCCGTTTTGCTGCGTTGTCCATCACTGCAAAGTGGACCTTATGATGCATCACTGTCTGTGAATCATGAAGAATTTGAATTTCACATGGATGAACTCTGTTCCAAAAGAACATTACCTGTTCTTCGAGACATGATTGCTGTTGTGGATACTGATGATGGTAACGCGAGTTCTGAGTTCTTTGCAGAAGCTTCATGGCGGGACAATGCAAGTGATTCTCCTCTCTCTTTTAATAGTGTTACAAAATGCAGCATACATACAGAACTAGATGGCTTGTCAGGGAGCTTCATGAAATCACATCCTTCTGAAAGGGATTATTTCACTGAACAGAGTAACTTCCAAAATAATACCCTTGCAAGGTTCAGGAAAATAGGTTCCGGTCATTGTTCAACAGATTTTAATTGGTATTCTGAGTCCCCATACTTGAAGATCAGGAATCCATCAGAGAATCTGGAACATTTTAATGATAAATATGTTGCTGAACTCAATTGCCGGTCCCGTGGAAGTGATACTTCTTGGAAGTTTAGAGAAAGGAAAGACAAACTTGACTTTGGTTATGATACTAATAATGTCACTGGTGGAGATTACCTTTCCTTGAACGCTGCAAATACTGCAGTAAATGATCAAACATTTCTCTGTCATGAGCAATGTTTAGATGATATTATCTTTGAACGAAGTGCTTGTTCTGATAAATTAACTAATGGAAAGGATTGGTTATGCTTAGACTCTTTTGATATGGAAACAGCAGATAGTTGTTCAGAACAAGTATTTCACATTCCTTCACCTAATGATTACAATAGGGGAAACAATCCAAGAAATCACTTGGGGTCTAGAAGTCATATGTACTATCAAGTTCTTAAGAAAAGATCAAAGAGAAGCCTTTCAGCCCCACCATTTTATAAAGGCAAGAAAAAGTTACATTCTATACAAAACAAATTGAGAACCACTGCAGGAGAAGGTGAAGAACAGATCATCCATAAGGCCTCTACTTTACCAGAAAGAAAGCAATTTGAGCATCCATCGCATTCTTGCCATATGTCTCACCAATATTTTGAACAGAATCTAGTTGATGATTCATTGTACTTTTCAAGAACTCACATGGAGGATAGGCCACATGACAGACAGTATATGATTGATGTCCAAGAGAGTGATGACTTCAGGAAACCTAAATATTTTGAGATGTACAATACAGATTTAGTCGAAGACTTCAATCCTGTTGATATGGAAGATCCAAAACTTTCCTGGGTCAAGTGGCAAGATGGCAATTCACAGGCTCCAGATGATGATGCGCCAGAGAAACTGCATGATCCAAATGATATACTTGATATCTTGTCTGGGATCTTGCACCTTACTGGTGATTCTTTAGTTCCCAAATCTATCAATAAAGATTGCCTTGAAGATGCCAGGGTTCTCCTACAACTGGACAAAAAATTTATCCCTGTCATAGCAGGTGGAACACTAGCTATCATTGACCAGCATGCTGCAGATGAAAGGATCCGACTAGAAGAACTGCGTCGGAAGGTATTGTCTGGTGAAGGAAGAACAGTTGCTTATCTGGATTCTGAGCAAGAGTTGGTGCTACCTGAGATTGGATATCAGCTACTGCACAACTACACAGAACAGATAAATAATTGGGGTTGGATTTACAATAATCAGGTTTCCGGGTCCTTCACCAAGAACTTGAATGTGCTAAATAGGCGGACTGCTACTGTCACACTTATTGCGGTACCTTGCATTTTAGGTGTCAAGTTGTCTGACAAAGACCTTGTAGAATTTCTTGAACAGCTTGCTGAGACAGATGGATCATCAGCTATGCCACCATCTGTTCTTCGAATCCTTTGTTTTAAAGCATGCAGAGGTGCAATCATGTTCGGGGACTCATTGCTACCTTCAGAGTGTTCATTAATTGTTGAAGAACTAAAGCAGACTTCGTTATGCTTTCAGTGCGCTCATGGGCGGCCAACTACTGCTCCTCTCGTCAACTTGGAGACATTGCATAAGCAGATTTCCCAGCTTCAGTTGTTTCATGGGGGTTCAAATGAGCAGTGGCATGGGTTGCAACGGCATCAACCAAGCCTAGAACGTGCATCACAGCGTTTAAACTCAACCAGAGATAATTTTGGGTGA |
Protein: MKSIKHLPKGVHSSLRSSVILFDLTRVVEELIFNSLDAGATKITVSIGVGTSYVKVEDDGSGITRDGLVLLGERNATSKLPSLAEIDVSMGSYGFQGEALGSLSDISLLEIITKARGRPSGYRKVIKGCKCLYLGLDESRQDVGTTVIVRDLFYNQPVRRKYMHSSPKKVLHSVKKCVLRIALVHPQVFFKVIDIESEDELLCTHSSLSPLSLLLNSFGSEISSCLHKLNFSQGVLKLSGYLSGLGEICSTKAYQYVYINSRFICKGPIHKLLKDVADSYMCLDLWKGSSGSQNGKRNRPQTYPTYILNFCCPRSSYDLTFEPSKTFVEFKDWAPILSFIEQAVRHFWSQISVQDISACSEIVEKKCKVKYYHKSLNHNCSSMEFASEEANCYEQKKHKMASKKLQRNTAEVKGQNVKAEYVPSSYHSLQDMVSNSCDPSITKSIPVVNQENGDDLLCVKCNALTERLAASQTATDDVKKFILGHKQGNESLKVDVMGEESTRTLLTCSDFEYRNDVERVSFPSGCTRDFEKSVLLRCPSLQSGPYDASLSVNHEEFEFHMDELCSKRTLPVLRDMIAVVDTDDGNASSEFFAEASWRDNASDSPLSFNSVTKCSIHTELDGLSGSFMKSHPSERDYFTEQSNFQNNTLARFRKIGSGHCSTDFNWYSESPYLKIRNPSENLEHFNDKYVAELNCRSRGSDTSWKFRERKDKLDFGYDTNNVTGGDYLSLNAANTAVNDQTFLCHEQCLDDIIFERSACSDKLTNGKDWLCLDSFDMETADSCSEQVFHIPSPNDYNRGNNPRNHLGSRSHMYYQVLKKRSKRSLSAPPFYKGKKKLHSIQNKLRTTAGEGEEQIIHKASTLPERKQFEHPSHSCHMSHQYFEQNLVDDSLYFSRTHMEDRPHDRQYMIDVQESDDFRKPKYFEMYNTDLVEDFNPVDMEDPKLSWVKWQDGNSQAPDDDAPEKLHDPNDILDILSGILHLTGDSLVPKSINKDCLEDARVLLQLDKKFIPVIAGGTLAIIDQHAADERIRLEELRRKVLSGEGRTVAYLDSEQELVLPEIGYQLLHNYTEQINNWGWIYNNQVSGSFTKNLNVLNRRTATVTLIAVPCILGVKLSDKDLVEFLEQLAETDGSSAMPPSVLRILCFKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGRPTTAPLVNLETLHKQISQLQLFHGGSNEQWHGLQRHQPSLERASQRLNSTRDNFG |